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Abstract 



In previous papers we have pointed out that there exists a QCD analog of the 
phenomenological concept of the so-called Fermi motion for the heavy quark inside a 
hadron. Here we show in a more detailed way how this comes about and wc analyze 
the limitations of this concept. Non-perturbative as well as pcrturbativc aspects are 
included. We emphasize both the similarities and the differences to the well-known 
treatment of deep inelastic lepton-nucleon scattering. We derive a model-independent 
lower bound on the kinetic energy of the heavy quark inside the hadron. 
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1 Introduction 



The theoretical and experimental study of deep inelastic lepton-nucleon scattering 
(hereafter referred to as DIS) was instrumental in developing QCD. Inclusive weak 
decays of heavy flavors - in particular semileptonic decays - are intimately related 
to DIS via channel crossing. It is then quite surprising to note that for a long time 
there were hardly any attempts to treat charm and beauty decays in QCD proper 
rather than within some phenomenological models. The flrst systematic attempt was 
undertaken in ref. some general observations were made [@] in the context of 
Heavy Quark Effective Theory (HQET) |^ . The real breakthrough occurred recently 
H, 1^ once a previous stumbling block had been removed 0. Through an expansion 
in inverse powers of the heavy quark mass tuq one has obtained a consistent treatment 
based on QCD. 

One flnds, not surprisingly, that the inclusive decays of heavy flavor hadrons are 
described by the decay of free heavy flavor quarks in the limit rriQ — * oo. (Generically 
we will denote the heavy quark by Q and the heavy flavor hadron by Hq). For 
flnite quark masses there arise pre-asymptotic non-perturbative corrections. A very 
obvious example is the following: the mass of Hq exceeds that of Q; therefore there 
is a kinematical regime in the Hq decay spectrum that cannot be populated in the 
free quark decays. At flrst sight there would seem to be a clear conflict between the 
observation and the l/rriQ expansion. Upon further reflection one realizes that it is 
the motion of the heavy quark Q inside the hadron Hq that will close this gap in the 
theoretical spectrum. In this paper we will give a more rigorous justiflcation of this 
intuitive picture, both in its perturbative as well as non-perturbative aspects. Our 
treatment of the momentum distribution of the heavy quark is based directly on QCD 
without recourse to phenomenological assumptions. We will introduce distribution 
functions that are universal in the sense that they will determine bound state effects 
in all inclusive transitions of a given heavy hadron. For instance, the very same 
function will deflne both the shape of the photon line in 5 — >■ 'fXg transition and the 
lepton energy spectrum near the end-point in the B — > WiX^ semileptonic decays. 

Historically Bjorken was the flrst to discuss the effects due to the heavy quark 
motion inside the heavy hadrons in the problem of the heavy quark fragmentation 
In application to the heavy hadron decays similar ideas were laid in the basis 
of the well-known AC^M^ model engineered as a simple non-relativistic model of 
the heavy quark motion (which was referred to as the 'Fermi motion'). This model 
is now extensively used in the analysis of the lepton energy spectrum in the semi- 
leptonic decays. Suffice it to mention that the published results on the semi-leptonic 
branching ratios in B mesons rely on this model to this or that extent. 

Our approach is quantitative and systematic. It reveals, in particular, that some 
aspects of the AC^M^ model can not be reconciled with QCD. The main observa- 
tion - that the QCD analog of a phenomenologically introduced Fermi motion of the 
heavy quark inside the heavy hadron exists in QCD - has been already stated and 
used to interpret the results in our previous paper on lepton spectra in inclusive heavy 
flavor decays Here we will present them in a more detailed and complete form. 



In the meantime papers |TT| have appeared treating these and related phenomena 



from close positions. In many instances, where they overlap with our analysis, there 
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is a general agreement, yet we go beyond these treatments. In presenting our ap- 
proach we focus, mainly for pedagogical reasons, on a simple example of the b ^ S'j 
transition. The results obtained, however, are of general validity. In particular, we 
adapt the general theory to include the case of the differential distributions in the 
semi-leptonic decays. 

The paper is organized as follows: in Sect. 2 we present a simplified analysis 
of 6 — s> s + 7; basic non-perturbative elements of our approach are discussed and a 
heavy quark distribution function is introduced. In Sect. 3 we treat radiative and 
semileptonic beauty decays in QCD assuming that the produced quark is massless. 
Sect. 4 is devoted to the case of the massive (non-relativistic) final quark. In Sect. 5 
we summarize the impact of radiative QCD corrections. In Sect. 6 we present a few 
comments on the literature. In Sect. 7 the lower hound on the kinetic energy of the 
heavy quark is obtained along with some speculations regarding the upper bound. 
Sect. 8 summarizes our results. 



2 A Toy Model for 6 ^ s + 7 

The phenomenon of the Fermi motion manifests itself in its purest form in 6 ^ s + 7 
transitions. For at the level of the free quark decay the photon energy is fixed by 
two-body kinematics to be 

Eio) . (1) 

where the strange quark mass has been neglected. At the same time in the 
inclusive decays of the B meson, B — > 7X5, there is a spread in the photon energy 
up to the kinematical boundary: 

Mr 

As mentioned in the Introduction, the motion of the b quark inside the B meson will 
smear out the infinitely narrow photon line in the free b quark decay. To obtain a 
quantitative treatment of this phenomenon we first 'peel off' all inessential complica- 
tions like the spin of the quarks and that of the photon; i.e. we consider a toy example 
where the heavy quark Q is a scalar and there is a real scalar field coupling the 
heavy quark field Q to the light scalar quark field q: 

= hQ(j)q + h.c. , (3) 

where h is the coupling constant, Q = Q"^ and the masses of the scalar field (p and the 
quark q are set to zero. Later on we will briefiy discuss the case ruq ^ 0. The field (f) 
carries color charge zero; the reaction Q ^ q + cp is thus a toy model for the radiative 
decays. It will allow us to focus on the non-pcrturbative effects; the conceptually 
quite relevant perturbative corrections will be addressed in Sect. 5. 

The total width for the free quark decay Q ^ q + (p is given by the following 
expression: 

r(g - q<P) = r/^ = Fo. (4) 
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The energy spectrum of in this hmiting case is an infinitely narrow fine, see Fig. 1, 

i^r.,.-!|^,. ,5) 

The strong interaction will smear this line over a finite interval of order 

k = MHQ-mQ. (6) 

and produce a picture of the type depicted in Fig. 2. (The interval of energies between 
vtiqII and will be referred to as window throughout this paper. The window 

and the adjacent domain below ^«q/2 whose size is ~ A will be generically called the 
end-point domain.) 

Introducing the transition operator defined as 

f = ij d''xe-"^'T{Q{x)q{x) , g(0)g(0)} (7) 

allows us a rigorous treatment of the spectrum since the spectrum in the inclusive 
decay is obtained from T in the following way: 

— = Im (HolflHo). (8) 

One applies the Wilson operator product expansion [0 (OPE) to express the non- 
local operator T through an infinite series of local operators with calculable coeffi- 
cients. Underlying this ansatz are certain assumptions concerning analyticity which 
establish duality between the quark-gluon and hadronic descriptions in eq. (||); for 
more details see ^, ||. 

To construct the OPE explicitly we consider the transition operator T first in the 
tree approximation (Fig. 3) where it takes the simple form : 

f = -j d'x{x\Q{X)-^^^^^Q{Xm (9) 



We have used here the Schwinger (background field) technique [13] and the cor- 
responding notations; i.e., the coordinate and momentum operators X and V are 
defined by 

V,=p, + A^{X), A^ = gT'^A;; (10) 

[p^X^] = ig,,^, [p^p^] = [Xf,X^] = 0, (11) 

while the states \x) are the eigenstates of X^, X^\x) = x^\x). 

It is obvious that if the quark Q considered is very heavy its momentum contains 
a large mechanical part, 

(Po)m = "^Q^M' (12) 

where is the four- velocity of the hadron Hq, = {pHQ)fj,/MHQ- In other words, 
in the x dependence of Q{x) the mechanical part can be singled out. 



Q{x) oc e^ 



-iPox 
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and, correspondingly, one can proceed from the operator to vr^ presenting the 
difference between and the mechanical part (-Po)/^, 

= (Po)m + (13) 

Assembling all these definitions it is not difficult to rewrite the transition operator 
(|^) in the form 

f=- I d^x{x\Q— ^— ^g|0), (14) 

which is particularly suitable for constructing OPE by expanding eq. (|14[) in the 
powers of vr. To zero-th order, when one neglects vr altogether, one gets 

^^°^ = -^<5(0)Q(0), (15) 

where 

k = Po-q. (16) 

Substituting this result in the general expression (||) one recovers the free quark 
prediction for dV/dE , eq. (|^), for uiq oo since {Hq\QQ\Hq) = 1 holds in that 
limit. 

All pre-asymptotic corrections in the matrix element < Hq\QQ\Hq > to order 
l/mg are reducible to the quantities A and {Hq\Q7i'^Q\Hq) []. To see that this is 

indeed the case let us first consider the heavy quark current, QiD^xQ, whose diagonal 
matrix element is fixed by conservation of current. In the rest frame of Hq we have: 

{Hq\QiD,Q\Hq) = 1 . (17) 



If we next use the decomposition (O) we arrive at the following relation 

^ {HQ\mQQQ + QtxqQ\Hq) = 



'^'^■{Hq\QQ\Hq) + _ {HqIQtt'QIHq), (18) 



where the second line is due to the equation of motion. Eq. (|TBp implies, in turn, 
that 

,^{Hq\QQ\Hq) = ^ fl - _ \^ {HqIQit'QIHq)] . (19) 
2Mhq 2mQ \ 2mQMHQ J 

The differential and inclusive widths are determined by the very same combination, 
{2Mhq)~^{Hq\QQ\Hq). Therefore, the results (|^), (|^) of the leading approxima- 
tion remain intact| at the level of l/mq and get corrected only at the level l/rriQ. 

^There is no analog of the chromomagnetic operator for scalar quarks. 

^The l/mg corrections at the intermediate stage appear due to relativistic normalization; in principle 
the calculations can be reformulated in such a way that they do not appear at all. 
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The l/mg correction in {2Mhq)~^ {Hq\QQ\Hq) is not the only source of the l/roQ 
corrections in the inclusive widths, however. 

Let us consider now terms 0(vr) and 0{ti'^) in the expansion of eq. (|14|). They are 



(20) 



(21) 



One should keep in mind that the operators in T^'^^ and T^'^^ are to be averaged 
over Hq, and we assume that this hadron is spinless or, if it has a non- vanishing spin, 
averaging over the spin is implied. Moreover, we will also use the equation of motion 
VToQ = (^V2mQ)g. Then 




(22) 



In f (2) we neglected the term oc vTq which is of the higher order in l/niq. 

Now, what corrections to the energy spectrum of 0, eq. (^, stem from the expan- 
sion of the transition operator we have just derived? Using the general relation (^) 
we readily obtain 



6'{E - ^) + ^6"{E - ^) 



(23) 



If in the leading approximation the spectrum was just a delta function, the corrections 
are even more singular! This is no surprise, of course. As was mentioned above the 
width of the line in the transition Hq — > (pXg is of order A. We expand in the 
powers of A/ttiq; hence we must expect the enhancement of the singularities in each 
successive order. It is clear that to describe the shape of the line one needs to sum 
up the infinite number of terms in this expansion. We will return to the issue of 
summation later on. Here let us notice that the corrections to integral quantities, 
such as the total width or the average energy, are properly given by integrating eq. 
(|23|). In particular, it is not difficult to see by explicit integration that the correction 
to the total width due to (|2^) vanishes. Then the difference between T and the free 
quark width Tq comes from the matrix element of QQ, see eq. (p!9D, 



r = rr 



2m% 



+ 



(24) 



The correction in the brackets has a transparent physical meaning, it reflects time 
dilation for the moving quark. In other words one can say that the l/rriQ in Tq is 
merely substituted by {1/ Eq) in the formula for the decay probability^. 

■^Only the kinetic energy enters here; if one identifies tt with usual momenta - what is assumed here - 
the Coulomb potential energy drops out and Eq differs from raq only by the kinetic energy. 
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As for the average energy, eq. (p3[) implies that 



iE-^) = ^{HQm'QlHQ). (25) 

Moreover, the dispersion is also calculable, 

{{E - = Y^JHqIQtt'QIHq). (26) 

The last equation shows that the line width has a natural hadronic size of order of 
the characteristic spatial momentum of the heavy quark inside the hadron (i.e. of 
order A). At the same time, the shift of the center of the line, eq. (pSj), is small - 
of order of (7f^)/mQ. To find higher moments of -E — (mQ/2) one should proceed to 
consideration of operators of higher dimension. 



The vanishing of the Qn^Q correction to the total width due to eq. (p3D can be 
simply understood on general grounds. Indeed, this operator is not Lorentz invariant 
and clearly can not appear by itself in the OPE for the total width which is the 
Lorentz scalar. As for eq. ([T9|) where the same operator enters we used there the 

mere substitution of Lorentz scalar QQ by a sum of two operators, QiDoQ and 
Qn'^Q] both are not Lorentz scalars. However, operators with non-zero Lorentz spin 
contribute to the energy distribution, their Lorentz indices are contracted with the 
4- momentum of the 4> particle. 

The first Lorentz scalar operator after QQ is the operator QQqq whose dimension 
is four - two units higher than that of QQ. The diagram giving rise to this operator 
is depicted in Fig. 4; it contains a hard gluon exchange , i.e. extra asimg) in the 
corresponding coefficient. The operator QQqq produces a correction oc asirriQ) /mq 
in the total width and in the spectrum which will not be considered in this work. 

The remark above concludes a summary of those results which have been already 
discussed previously in other contexts, see - 0- We proceed now to the issue 
of the line shape. As was mentioned above, the question of how the delta function 
line of the parton approximation is smeared into a physical finite size line requires 
summation of all orders in vr in the expansion of eq. (p!^. The formal series stemming 
from (|Tip is 

To evaluate the relative roles of different terms in eq. ( P7| ) one should keep in mind 
that the characteristic values of the momentum k = Pq — q'm the domain of interest 
are 

/co ~ ~ mQ/2, rriQA. 

If the first estimate is quite obvious, the second one, probably, calls for a comment. 
Kinematically —k"^ = 2mQ{E — (mQ/2)) and inside the line \E — {mQ/2)\ ~ A. 
In particular, on the positive side the maximal accessible value of E — {mQ/2) is 
(M^^-mQ)/2 = A/2. 

It is clear then that the vr^ terms in eq. (^) can be systematically neglected 
compared to 2kTT. Indeed, all terms (2fc7r/A;^)" are of the order of unity in the domain 
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under consideration. At the same time, insertion of every additional vr^/Zc^ factor 
leads to a suppression of order K/itiq. These insertions become important only at 
the subleading level, a question that will not be touched upon here. 

Thus, keeping only those terms in the series that do not vanish in the limit niq 
oo we get the following expansion for T: 

1 CO , Ox" 

^ = E (-p) (Qvr^,...vr^„Q - traces) . (28) 

This expansion is very close to the expression quite standard in the OPE analysis 
of deep inelastic scattering. The terms kept are analogs of the twist two operators 
appearing in the latter case; those with vr^ which we neglected represent higher-twist 
effects. Only symmetric traceless operators of the leading twist are relevant to the 
approximation accepted in this work. In other words, the theory of the line shape 
in the heavy quark decays developed here is similar to the theory of the structure 
functions in deep inelastic scattering with the power 1/Q corrections discarded. We 
will be returning to this parallel with deep inelastic scattering more than once. 

After the transition operator T is built the next step is the averaging of eq. (P^ ) 
over the hadronic state Hq. Using only the general arguments of Lorentz covariance 
one can write 

{EQ\Q^[^,^...^[^,^Q - traceslifg) = a„A"(t;^j...t;^,^ - traces). (29) 

Notice that we single out the dimensionful factor A™, a natural scale for (vr"), so that 
all coefficients are dimensionless numbers of order one (if n is not parametrically 
large); in principle any other physical scale of order A can be used instead. With 
these definitions we readily arrive at 




^^Q\n^Q) = -TT.Y.'^A-iTrr-A ; (30) 



all trace terms are omitted here since they obviously convolute fc's with each other, 
and each /c^ in the numerator brings in a relative suppression oc K/mq. 

A consequence that immediately stems from this result is the existence of a "scal- 
ing" variable. The line shape is determined by a function depending on the energy 
only in the "scaling" combination 

- = -4^ = t(^-— )• (31) 

In the physical domain a formal interval of variation of the variable x is {—mq/K) < 
X < 1, however to establish the duality relations one as usual has to consider eq. (|30|) 
in the complex plane. Of course, it is clear that on the negative side the distribution 
dies off steeply at x < — xq where xq is a positive number of order one. It is worth 
emphasizing that xq needs not be one, and its value is set dynamically, in distinction 
with deep inelastic scattering where the limits of variation of the Bjorken variable are 
established kinematically, < x^j < 1- 
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We now can continue our parallel with deep inelastic scattering further. Let 
us show that the coefficients a„ are nothing else but the moments of a universal 
distribution function F{x), the function determining the line shape. Namely, 

dxx'^F{x), n = 0,l,.... (32) 

-oo 

We will see in a moment that on general grounds F{x) must be obviously positive 
and indeed its support lies below 1 as is stated by eq. (^); moreover F must fall off 
exponentially towards negative infinity, so that practically the integration interval is 
limited by —Xq from below. 

We need to emphasize at this point that the distribution function F{x) introduced 
so far describes the "primordial" heavy quark motion inside the hadron and does not 
contain the effect of the hard gluon emission in the course of the Q ^ q + cj) transition. 
This emission can create a light hadronic system of arbitrary invariant mass in the 
final state, which would correspond to a long q;s/(1 — x) tail in the line shape at the 
negative values of x. This perturbative tail should be considered separately, of course. 

Substituting eqs. (B^) and (|30D in eq. (H) we arrive at 



2 



dE 71 Mhq J ' ' k"^ + 2yAkv + ie 



=ToF{x) 1 + 0{ — ) (33) 
A V J 

where x is defined in eq. (pT]). We see that the primordial heavy quark motion does 
indeed smear the parton delta function in TQ^dT /dE: instead of the delta function 
we have a peak centered at ^«q/2 whose width is ~ A and height is ~ 1/A (see Figs. 
1, 2). Notice that the normalization condition 

/rfB§ = r. (34) 

taking place in our approximation is automatically satisfied since 

J dxF{x) = ao = {Hq\QQ\Hq) = 1. (35) 
For a few higher moments it is not difficult to get 

J dxxF(x) = ai = 0, (36) 

/ dxx^F{x) = a2 = -^{HqIQtt^QIHq), (37) 
3A 

/ dxx^F{x) = as = -^{Hq\Q{D^GMHq). (38) 

6A 

The gluon operator -D^G^o in eq. (0) emerges from the commutator [7r^[7r^7ro]] and 
reduces to the light quark current by virtue of the equations of motion. The occurrence 
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of this operator, not related to powers of the spatial momentum tt, imphes that the 
distribution function F{x) can not be interpreted as a non-relativistic heavy quark 
wave function in the momentum space. 

Summarizing, we presented the procedure of introduction of the universal distri- 
bution function describing the heavy quark motion, so far in a simplified environment 
of our toy example. The procedure is quite similar to what is usually done in deep 
inelastic scattering, with certain peculiarities, though. The non-perturbative param- 
eter A plays here a role analogous to the nucleon mass in deep inelastic scattering 
(although we will see that it is literally so only as long as the effect of perturbative 
corrections is neglected). As in the latter case the "primordial" function F{x) is not 
calculable in perturbative QCD. At the same time, the renormalization of F{x) due to 
gluon exchanges is in principle calculable although at this point the straightforward 
similarity with DIS does not hold. 

Concluding this section it is instructive to make a few important remarks of a 
general nature. First, in spite of the non-relativistic nature of the heavy quark motion 
inside Hq the genuine distribution function F{x) is in no way equivalent to the non- 
relativistic (momentum-space) wave function of the heavy quark; rather it is given (for 
light final quarks) by the distribution over the light cone combination of momentum 
components. The distinction, as we discussed above, see eq. (|38|), becomes apparent 
already for the third moment. In other words, all non-relativistic models (say, the 
AC^M^ model and other models of this type) are doomed to be incompatible with 
the genuine QCD-based picture of the heavy quark motion, although numerically in 
some parts of the spectrum the models can come close to what is expected from 
QCD. (This happens, for instance, with the shape of the spectrum in the inclusive 
semi-leptonic decays, see ref. ||^). One should not expect from these models to 
reproduce all and every details of the QCD-based predictions [Q. 

Second, the theory developed is not applicable in the tails of the distribution. 
One of the tails, at negative values of x, has been already mentioned. The hard 
gluon emission in the transition Q — > g + populates the domain x < —xq in the 
spectrum at the level a.^/ {^—x) which is much higher than the primordial heavy quark 
motion. (For more details see Sect. 5). Thus, the negative tail of the "primordial" 
F{x) is simply buried in the intensive perturbative background of Q ^ g + gluon + 0. 
Another change of regimes happens at positive values of x, very close to the end-point, 

2 

E = Mhq/^. When E — {Mhq/2) ~ A /mq the invariant mass of the final hadronic 
state produced in the transition is of order A^, not Amg. Formally the operator 
expansion for the transition operator T we used is not applicable in the end point 
region if one aims at such high energy resolution because the corresponding operator 
enters then in the complex plane too close to the singular point. Physically it means 
that this small end-point interval of the spectrum belongs to the resonance domain, 
and the inclusive approach we have used for developing our theory is inapplicable. All 
probabilities in this small end-point interval are definitely suppressed by additional 
inverse powers of mg. For very large itlq one could try [|15| to use the form-factor 



type approaches |]T6| for a quantitative description that accounts for the exchanges 
of semi-hard gluons. For practical purposes of treating the b quark decays these 
approaches do not seem to be useful, however, since the b quark is presumably too 
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light to justify the use of the asymptotic form-factor methods. 

3 Real QCD: B ^Xg and Semi-Leptonic Decays of Beauty 

In this section the analysis carried out above in the toy example will be extended to 
the actual processes one encounters with in the genuine QCD, namely the radiative 
transitions of the type B •yXs and the semi-leptonic inclusive decays of the type 
B IviXu- For simplicity in this section we drop our generic notations Q and Hq 
and speak about h quarks and B mesons. The results will be equally applicable to 
any heavy quark Q and any Q containing hadron, meson or baryon. In those cases 
when the spin of the decaying heavy hadron is non-zero the averaging over spin is 
implied. 

We start from a purely kinematic question of introduction of the structure func- 
tions. Quite obviously, the transition operator T now becomes a Lorentz tensor, 

f,,=ij rf^xe-^^^T{jt (x)jUO)} (39) 
where the current depends on the process considered. For the semi-leptonic decays 

j^ = M7;.(l-75)& (40) 
while for the radiative transitions (neglecting the s quark mass) 

j^ = 29,[s(l + 75)a^.6]. (41) 

Averaging T^^, over the unpolarized beauty state one gets the forward scattering 
amplitude h^y 

- hi + h2Vf,Vu-i h ^^uap v^q^ + q^ + hr, {q^ + q^ v^), (42) 

where the hadronic functions hi depend on two variables, qv and q"^ and eoi23 = — 1- 
The measurable inclusive rates are expressible through the imaginary parts of hi, 

Wi = 2lmhi. (43) 

In full analogy with deep inelastic scattering we will refer to Wi{qv,q'^) as to the 
structure functions. In the most general case there are five structure functions (not 
six, as indicated in @). They are, clearly, different for the semi-leptonic and the 
radiative transitions. Moreover, the generic decomposition of eq. (|^) significantly 
simplifies in the case of the radiative transitions because of the transversality of the 
current (^1]) and due to the fact that here = 0. Concretely, 

-(lisc{h^y)B-.-tX, = gi{-gf,u{qv) + q^v^, + quV^){qv) - ig2e^,u'y&v"' q\qv) (44) 
where gi and g2 are functions of qv. 
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To complete the kinematical analysis let us give the expressions for the differential 
rates (inclusive with respect to the final hadronic states). In the semi-leptonic decays 



dEi dq^dqo 

I Kb I [2gV + [AEi{q^ -El)- q^] W2 + 2 q\2 Ei - q^) w^]. (45) 

while in the radiative transitions 

dV 

-(B^,X,) = ^^,W (46) 

Here go = Qv, Vub is the CKM matrix element, Gi? is the Fermi constant and finally 
the coupling constant A parametrizes the strength of the 6 — ^ 57 transition 

i _ A^ 
C\b_,s-y = -\E^^s{l - j5)(^fiub, To{b S7) = —ml. (47) 

It is worth noting that W4 and do not enter in the semi-leptonic rate due to 
the fact that we neglected the charged lepton mass. Moreover, in the spectrum of the 
radiative transitions the function (72 drops out; to measure (72 it is necessary to detect 
the difference in the number of the left-handed and right-handed circular polarized 7 
quanta, 

= il. (48) 

We proceed further in close analogy to our toy example. There are some distinc- 
tions, however, which deserve mentioning right now. First, we deal now with several 
structure functions: Wi {i = 1,...,5) in the semi-leptonic decays and gi {i = 1,2) in 
the radiative transition. All of them will be expressed in terms one and the same 
distribution function F{x) of the heavy quark b in the B meson. Thus, F{x) is a 
universal distribution in the same sense as the light quark distribution functions in 
deep inelastic scattering. 

The second point, also quite similar to deep inelastic scattering, is the scaling 
feature. Namely, the variable x has the form 

^ _ T^Q + (f - 2mQ{qv) 

^ ~ 2Akv ~ 2A{mQ - qv) ' ^ ' 

k,, = mgv^ - q^ 

and all structure functions Wi which, in general, depend on two variables, fc^ and kv, 
are actually expressible in terms of the function F{x) depending on the single variable, 
the ratio k'^/kv. Of course, in the case of the radiative transition two independent 
variables kinematically degenerate into one. 

The scaling under discussion is an exact analog of the Bjorken scaling in deep 
inelastic scattering and, likewise, is broken by perturbative loop corrections (which 
are calculable) and by non-perturbative power corrections (higher twists). 

^The strange quark mass is neglected. 
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It is already quite clear from Sect. 2 that our formalism is based on the consider- 
ation of the transition operator given, in the case at hand, by eq. (|39|). In the tree 
approximation corresponding to the graph of Fig. 3 we substitute the current 
generating the weak semi-leptonic decay in eq. (Wh and get 



f,u = -2 / d'x{x\hf.j^lu{l - ln)b\0) (50) 

where the Schwinger technique is used again. After simple algebra the expression 
(|50|) is transformed into 

In the leading approximation one can neglect /f compared to in the numerator 
and TT^ + {i/2)afj,i^G^u in the denominator. Moreover, the product of three 7 matrices 
reduces to 7q, and 70-75. The operators with 70,75 drop out upon averaging over the 
unpolarized hadronic state, while 7^ can be substituted by Va- Assembling all these 
elements together we arrive at 

V = 7Tir?~(^I^M!^l^) = {-9fMu{kv) + k^Vy + k^v^ - ie^^apk'^v^jx 

As explained in Sect. 2 only the leading twist operators will contribute in our ap- 
proximation. Their matrix elements are defined as 

{B\ S hTx^^...-n^J)- traces \B) = ajC'{Vf,^...v^„ - traces). (53) 



2M^ 

where S is the symmetrization symbol and A = Mb — mi,. 

Next, we introduce the distribution function F{x) via its moments, 

an = J (Ixx'^Fix), n = 0,l,... (54) 

The lower limit of integration is to be specified dynamically, the upper limit is 1 
(cf. eq. (P^D). For the first three moments of F{x) we have expressions perfectly 
analogous to those quoted in eqs. 



J dxxF(x) = «! = 0, (55) 

/ dxx'^F{x) =a2 = -^{2MB)-\B\bn%\B), (56) 
3A 

/ dxx^F{x) = aa = -^(2Mb)~^(S|6(D^G^o)&|5). (57) 
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Calculation of the discontinuity of /i^i, is straightforward and leads to the following 
structure functions: 

wi = Y^W' 

W2 = - — Wi, W3 = —Wi, W4 = 0, W5 = -—Wi. (58) 
kv kv kv 

Notice that all these expressions are transparent generalizations of the formulae re- 
ferring to the free quark decay. The latter case, the free quark decay, is recovered by 
the substitution 

F{x) 6{x). 

Absolutely similar procedure yields the following results for the functions gi in the 
radiative decay: 

91 = 92 = Y^(^) (59) 

where now 

X = jiqo - ^) (60) 

(cf. eq. i^). 

After the structure functions are found mere substitutions yield the inclusive dis- 
tributions sought for, 

= T,{h^s^)=F{x), (61) 

dEM'dq, = ^ K^^;^ ^''^ 

where 

T,{b^W,u) = K,\^^ (63) 



and the argument x of F{x) is defined by eq. ( pOD for B ^Xg and by eq. (|49|) for 
B ^ lui Xu. 

Notice the scaling feature of the differential distribution (|62D, the structure func- 
tions Wi{qQ,q^) of two variables are expressed via a function of one variable x = 
(2m{,go ~ ^1 ~ 'f) I ('^fe ~ This statement is a full analog of the Bjorken scaling in 
DIS. The parallel goes further, of course. Like in DIS, perturbative gluons will lead 
to a breaking of the scaling law. The perturbative violations of scaling are calculable; 
some aspects of the radiative gluon corrections will be discussed in the next section. 

We pause here to warn the reader: one should be cautious in applying eq. ( |62D in 
the entire allowed kinematic domain. It is not valid in certain boundary subdomain. 
For instance, for the maximal value of the g^, 

the differential rate turns out to be negative. This happens because our approximation 
fails in the domain where the invariant mass squared of the hadronic state produced 
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is of order A , not Am;,. The error due to the inaccuracy of eq. ( |6^ ) in the boundary 
subdomain shows up in the integral quantities, like, say, the total decay rate, etc., at 
the level of A /mf. In the present paper we do not discuss these quadratic effects. 
Integrating eq. (|6^) we get the result for the lepton energy spectrum in the form 

^ =To / _dxF{x) M?>mb-AEi + Kx). 64 

dEi J(2E,-mb)/A ml 

The term Ax in the brackets is of a kinematical origin. Corrections to the spectrum of 
the same order, 0{A/mb), are generated also dynamically, via higher twist operators 
in OPE which appear if one accounts for the terms / and vr^ + {i/2)aG in eq. (|5lD. 
In the above analysis these terms were neglected in the expansion. The higher twist 
effects can be treated essentially in the same manner it is usually done in DIS, see 

e.g. m. 



If the energy Ei is not too close to its upper limit eq. (0) reduces to the spectrum 
of the free quark decay. Indeed, when {■mb — 2Ei) >> A, the lower limit of integration 
in eq. ( |5^ becomes a large (negative) number, and the integrals over x can be done 
explicitly ( see e.g. eq. (pSj)) 



/ dxF(x) = 1, / dxxF(x) = 0. 

J—oo J —oo 



Of most interest is the range of energies in the window, 

<2Ei-mb<A (65) 



and nearby. For such energies the lower limit in the integral ( p^ becomes of order 
1, and the end point shape of the lepton spectrum is expressed via integrals over the 
distribution function F{x). Although this function is unknown, it is the same one 
that determines the inclusive radiative transitions. 



4 Including the Mass of the Final Quark 

Here we address the question what happens if the final quark mass is non-vanishing. 
In general, the primordial motion of the initial heavy quark will now manifest itself 
via different distribution functions. This shows again that non- relativist ic models of 
the AC^M^ type are too gross to reproduce all subtle features stemming from QCD. 
We plan to discuss the issue in more detail elsewhere. Here, just to illustrate our 
point, we will consider the limit of the non-relativistic final quark and again resort to 
the toy example of Sect. 2. The quark spins play no role in this limit, of course, and 
can be ignored. 

If the initial quark mass is mg and that of the final quark is nig we assume that 

Am = rriQ — uiq « ttlq. (66) 

Of course, the validity of the inclusive description we consistently use requires simul- 
taneously that 

Am » A. (67) 
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This is the so-called small velocity (SV) limit |T^. In this limit the velocity v of the 
final quark in the transition Q g0 is 

\v\ = Am/rriq. 

Notice that Am coincides with Mhq — M^^, and the physical velocity of the final 
hadron is the same as above. 

In the SV limit the (p spectrum is peculiar. At the free quark level it consists of a 
single monochromatic line, similar to eq. (|^), 

— = ToSiE - Eo), Eo = Am = Mh^ - Mh,. 

If we trace only terms 0{v^) the physical spectrum is exactly the same - the delta 
function peak residing at the same place - and the inclusive probability is completely 
saturated by one heavy meson in the final state [|l^. Notice that there is no 'window' 



- no shift is present between the maximal allowed energies of at the quark and the 
hadron levels in the case at hand. 

Modifications of this perfect quark-hadron duality start at the level of Oiy"^) |T9|| . 
If 0{v'^) effects are considered the height of the elastic peak is changed, and a comb of 
inelastic peaks appears, the height of the latter being proportional to v"^. This comb 
will lie aX E < Eq and will be stretched over an interval of energies of order A. (As 
was mentioned above, A is not a relevant parameter in this problem; we can continue 
to use it, however, just as a typical hadronic scale. One could certainly choose another 
definition of the typical hadronic scale.) The integral over the inelastic peaks must 
compensate the distortion of the elastic one - a variant of the Bjorken sum rule [ p!9|] 
taking place in our toy example. What we would like to do is to relate the inelastic 
spectrum (the comb of resonances in the final state) to a new distribution function 
connecting this spectrum to the motion of the initial heavy quark inside Hq. 

To this end we again turn to consideration of the transition operator ([^). Our 



analysis is changed in a minimal way - eq. (|27|) is substituted by 



where q is the momentum of 0. The SV limit means that q is to be treated as a small 
parameter, and we will expand in q keeping the terms up to the second order. In the 
zero-th order in q the equation of motion 

immediately tells us that the corrections to the free quark result are absent (up to an 
overall normalization), 

{HQ\f^')\HQ) = -J-—{Hq\QQ\Hq) = 
rriq - 
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M 



(l + 0{A'/ml\ 



In the first and second order in q we get 

{HQ\f'''\HQ) - 



{Hq\T^'^\Hq) = 



2mQ 



E 



{niTiQiTi). 



(69) 



(70) 



(71) 



To compress our formulae liere and below we resort to a short-hand notation, for 
instance, 

What follows just parallels all steps leading to the introduction of the function 
F{x). We introduce now a new distribution function G{x) whose moments bn are 
realted to the matrix elements of the operators in eq. ([7I|), 



(TTiTToTTj) = bnA 

dxx'^G{x) = h 



+2 



(72) 
(73) 

It is evident that G{x) can be called a 'temporal' distribution function, unlike F{x) 
which may be called the 'light-cone' distribution function. 

Notice that Bq = 3a2 and is related to (vr^) while bi = 803 and is related to 
{{DiEi)) where Ei is the chromoelectric field. For bn with n >2 the operators HiTTQiTi 
are expressible in terms of the chromoelectric field and its time derivatives. 



(vr.<7r,)=r-^(E,(Do"i^,)). 



(74) 



Substituting eqs. ([72D, ([73|) in eqs. (|69| ) - (^) we obtain for the imaginary part 
of the transition operator. 



Im {2MHj~'{HQ\f\HQ) 



TT 



5'{x) 



'go(5^) , 1 <f 
2m'> 



5{x) 



3 im?r 



dyy-'-'Giy) + 



1^ 

3 m, 



2 X ^G'(x) 



(75) 



where the variable x is 



a; 



2mQA 



E - Eq 



m„ 



2mr 



The first line in eq. (75) describes the elastic peak renormalized by (f corrections. 
The second line is a small shift in the position of the elastic peak. Finally, the third 
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line represents the inelastic contribution which, as we see, is proportional to (f in full 
accord with the general expectations [|1^ . 

If we now combine this imaginary part with the general expression (j^) we arrive 
at the physical spectrum dV/dE in the Hq (pXq decay. In principle, eq. ([73| ) has 
a wider range of validity since it is applicable for 7^ as well. In the case = 
one must substitute 

q^ = \q\ = E = Eo + Ax. 

Different terms in eq. ([7^) are of the different order in A/tuq. In the leading order 
in A/itlq the second line must be omitted and (f in the first and the third lines must 
be replaced by Eq. Moreover, the overall factor E relating eq. (|75|) to the spectrum 
(H) must be replaced by Eq as well. 

If one calculates now the total decay width F by integrating over the (j) energy 
the terms proportional to (f in the first and the third lines cancel each other and the 
total width obtained in this way coincides with the free quark one. The suppression 
of the elastic peak is exactly compensated by the integral over the inelastic part of 
the spectrum. Non-renormalization of the total width in the order ^) is nothing 



else than the Bjorken sum rule 



This assertion of the non-renormalization of the total width should, of course, sur- 
vive inclusion of the A/ttiq corrections simply because these corrections, as we know 
P, , are absent in the total width. It is instructive to trace how this cancellation is 
arranged. 

It is a rather straightforward exercise to keep corrections of the first order in A/ tuq 
in the spectrum, and then to check that the cancellation persists, and the Bjorken 
sum rule still holds. Now we can not use the substitution E Eq, of course; instead 
we must use E = Eq + Ax, and the second line in eq. ( |75D must also be taken into 
account. One should also keep in mind that the function G{x) itself was defined 
above only in the leading order. We can generalize the definition, however, to include 
corrections A/mq in a corrected definition of G{x). What is important is that the 



same new (corrected) function will appear in all lines in eq. (1751) , which will guarantee 
the desired cancellation and the validity of the non-renormalization theorem [^S], p!9t| . 
Let us now discuss in brief the general case of the arbitrary ratio 

_ rng_ 

The physical boundary between what can be called here the light final quark and the 
heavy one lies at 

~ AniQ. (76) 

If rrig << (Amg)^/^ then the effects due to rUg are small. On the other hand, to the 
right of the boundary ( [75| ) these effects become of order one, and the question arises 
as to how one can generalize the analysis above. 

The generic distribution function for the arbitrary mass ratio 7 is introduced 
through the matrix elements of the operators 

^-^{HQ\Q{u7rrQ\HQ) = J dxx^F,{x), (77) 



2MhqA 
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where the four-vector v is defined as 



= — ^, 1/ = 47 . 

It is not difficult to check that for 7 = we return back to the definition of the 



'hght-cone' distribution function, and F„ coinsides with F, eqs. (|53|), (15^). In the 
opposite limit 7 — 1 the function Fi, is proportional to (1 — 7)^6* where G is the 
'temporal' distribution function. 

Thus, although the heavy quark distribution functions introduced are universal 
for each given value of 7, for different 7's we have to deal with different distributions. 
This is another manifestation of the fact that the naive models of the AC^M^ type 
are intrinsically incompatible with QCD. 

We pause here to make several remarks. Notice that at 7 7^ and ^ the 
scaling law (the dependence on a single variable, eq. (^91) ) does not hold any more. 
We do not know what replaces it. 

Our next observation concerns the size of the window. It is quite obvious that the 
window shrinks as 7 approaches 1, as (1 — 7)^ A. In other words, 

F^{x) = at x > i(l -7)^A. 

This property implies a specific behavior of the matrix elements (^) at large n. It is 
clear that if 7 is close to 1, when the window is essentially absent, the introduction 
of the parameter A is purely artificial. Any dimensionful parameter of the typical 
hadronic size could have been used for parametrization of the matrix elements ([TTl). 

Needless to say that the analysis carried out above is trivially generalizable to 
the case of real QCD. The strange (and the more so u and d) quarks produced in 
the b decays can be treated as light (massless). As for the c quark, it lies close to 
the boundary (|76D, ~2 GeV^ and Arub ~ 2 GeV^. Thus, formally it does not 
belong to either of the limits. Still arguments exist showing that the non-relativistic 
description of the c quark in the inclusive b decays is not so far from reality. 



5 Radiative Corrections 

The above discussion of the end-point behavior of the spectra refers only to purely 
nonperturbative effects. In reality the perturbative corrections are also present. As we 
know from experience with DIS the perturbative corrections are not small because 
of the presence of large logarithms and lead to violation of the scaling behavior. 
We need an analog of the Lipatov-Altarelli-Parisi evolution kernel to account for 
the perturbative effects. The primordial distribution will be convoluted with this 
evolution kernel. 

Let us first sketch a qualitative picture using the example of -B ^ 7-^s- In the free 
quark approximation the photon spectrum is a monochromatic line (the delta function 
in eq. (|^)). The primordial spread of the heavy quark momentum substitutes the 
monochromatic line by a finite width line. We account for the primordial spread 
by convoluting the primordial momentum distribution with the delta function kernel 
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(see eq. (|33|)). The width of the hne becomes of order A. Outside this end-point 
domain whose size is ~ A the primordial motion has a neghgible impact on the 
photon spectrum. 

As a matter of fact, even without the primordial spread, the perturbative gluon 
emission smears the delta function of eq. (|), producing quite a drastic effect both in 
the end-point domain and, even more so, outside this domain. First, a radiative tail 
is generated below E = mf,/2. If (mfe/2) — ~ [mf,/2) the radiative tail is 0{as)', it 
is further enhanced as one approaches the end-point domain. The end-point peak in 
the spectrum looses its delta function shape, but still a certain singular behavior at 
E = mb/2 persists. The shapes of the radiatively distorted peak and the adjacent tail 
are determined by the Sudakov exponent. The total probability is redistributed: the 
area under the peak in the end-point domain (which used to be one in appropriate 
units in the Born approximation) is now ~ {A/mi,y° where eo is a positive number, 
and the total probability is saturated in the tail. Clearly, the perturbative gluon 
corrections do not generate the spectrum in the window mb/2 < E < Mb/2. To 
describe the effect of the filling of the window one has to include the primordial 
momentum distribution of the heavy quarks which is now to be convoluted with a 
new kernel, created by the gluon radiative corrections. 

The distortion of the delta function (H) described above is physically quite trans- 
parent. The total area under the distorted curve is unchanged, however. The picture 
must be very familiar to those readers who remember the treatment of the electro- 
magnetic radiative corrections in the J/ip peak in e~^e~ annihilation There, the 



natural width of the J/ip meson is also negligibly small, and the observed shape of 
the peak is totally determined by two effects: the radiative smearing of the original 
Breit-Wigner J/ip peak and the energy spread in the colliding beams. These two 
effects are convoluted. 

We proceed now to a more quantitative discussion of the issue concentrating on 
the example of -B — ^ 'jXg. The conclusions will be of a more general nature, of course. 
The convolution discussed above can be written in the following way: 

"^-^^'> - f iyF(y)^!EnEfmyl (78) 



dE J ' ' dE 

where dTB{E)/dE denotes the observable (physical) 7 spectrum while dT^'^^^/dE 
refers to the photon spectrum in the b s + 7+ gluons transition in perturbation 
theory. If dV^^^^/ dE is substituted by the delta function distribution (||) we recover the 
old result (|33|). On the other hand, if the perturbative smearing produces dV^^^^/dE 
that is much broader (as function of x) than F{x) then the primordial spread F{x) 
does not affect the physical spectrum drB{E)/dE, and the latter coincides with the 
quark-gluon one unless one is interested in small corrections. 

To find what situation is actually realized we need to explicitly calculate dV^^^^/ dE. 
As we will see shortly, the shape of the peak characteristic to the primordial momen- 
tum spread F{x) is recovered in the end-point domain. The peak in the physical 
spectrum near E = mb/2 persists, although its height is scaled down significantly. 

To begin with let us consider the one gluon emission. If one limits oneself to the 
double-logarithmic approximation (DLA) the result can be borrowed from text-books; 
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namely one obtains 



dE 



2mh 



71 



Tolm 



(79) 



where Fq is given by eq. (^71) , k"^ = {mhV — qy = 2mb{E — {mj, / 2)) , and is the gluon 



couphng constant. We are mostly interested in the end-point domain {E — (mfc/2)) 
A, 



I.e. 



k'^ ~ mfeA; 



(80) 



m 



then the logarithm of ml / k"^ can and must be treated as a large parameter. Eq. 
keeps only the double-logarithmic terms, those with one log less are omitted. 

A subtle point which deserves mentioning in connection with eq. ( |79D is an "en- 
hanced" singularity at fc^ = 0. For calculating the imaginary part of eq. (|79| ) per se at 
k"^ = one should, in principle, regularize the logarithm in the infrared region. There 
is no need in any explicit regularization, however, since the summation of the double 
logarithms provides us with a natural regularization - the point fc^ = is absolutely 
suppressed after the summation. 

Collecting all double logarithms in the standard manner we get the well-known 
Sudakov exponent. 



dE 



2mh 



TT 



Folm 



1 



-k' 



exp 



2a. 



37r 



In^ 



mt 



-P) 



^1) 



The fact that the gluon coupling as runs is unimportant in deriving eq. ( |8TD since 
this expression corresponds to DLA, while the running nature of as becomes visible 
only at the level of subleading logs. With the double log accuracy a. normalized 
at any point is equally suitable in eq. (^). We can try to guess, however, what 
would be the effect of subleading logs on on physical grounds, without carrying 
out a complete and consistent analysis (for a dedicated discussion see ref. |^T|). If k"^ 
provides an infrared cut-off for all transverse momenta in the problem then the largest 



possible value of a. is as{k ). Then substituting a. — > as{k ) in eq. ( pl|) we only 
overestimate the Sudakov suppression, and actually its influence will be milder than 
the estimates following below show. We will anyway use the prescription as — > as(fc^) 
for a simple-minded modelling of the Sudakov effect. Of course, this aspect can be 
improved. Some trivial improvement will be mentioned shortly. 
The validity of DLA implies that the condition 



as ml 



« 1 



must be met. This condition is numerically satisfied provided that k"^ is chosen in the 
interval (POj), as we will see below. 

Next we observe that the Sudakov exponent can be identically rewritten as a power 

of 



exp 



2a,(A;2 
37" 



In^ 



ml 



-k^)_ 



-k' 



mt 



12) 
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where the exponent e is given by the following expression: 



provided that the prescription «s(fc^) is indeed valid. If k'^ = —nihAx then, 

obviously, 

^ _ _8_ ln(mj,/A) -\nx 

^ ~ 3b ln(mb/A) + 2 ln(A/A) + In a; ^ ^ 

where b is the first coefficient in the Gell-Mann-Low function and A is the scale pa- 
rameter of QCD. For x of order one and large ln(mfe/A) this expression approximately 
reduces to a constant, to be denoted below by eo, 

eo = ^ ~ 0.3. (84) 

The numerical value of eo turns out to be rather small, a fact crucial for our consid- 
eration. 

Let us parenthetically note the following. If, instead of as{k'^) as an overall factor 
in the Sudakov exponent, we kept the running inside the integrals determining 
the double logarithms the only modification of the results (within the accepted ap- 
proximations) would be a shift of eo towards a lower value. This is physically quite 
transparent since the Sudakov exponent becomes less suppressive. Furthermore, sub- 
leading terms given in text-books also work in the same direction: they enhance the 
pre-exponential factor tending to compensate in part the exponential suppression. 

Summarizing we can say that the net effect of the Sudakov suppression reduces 
to the substitution ^ ^ 

P ^ (P)l-eo(^2).o- 

Correspondingly, the radiatively corrected spectrum takes the form 

Idrr 2 sinvreo ^86) 



Fo dE Txnib (1 - (2E/mb))'~'" ' 2 

It is not difficult to see that this expression is actually valid as long as \E — {mi)/2)\ <^ 
(mf,/2), i.e. we are not necessarily confined to the end-point domain, eq. (|80D . 
Moreover, for qualitative purposes one can extrapolate eq. (^) even further down, 
to (mb/2) — E {mb/2). One sees then that the photon spectrum is dramatically 
reshuffied by the gluon radiative corrections: the delta function at mfc/2 is substituted 
by a much milder peak at this point, plus the tail which saturates the probability. 

To substantiate the point let us check what contribution to F comes from the 
end-point domain by integrating eq. (|86D from E = mh/2 to E = {mi)/2) — A, 

1 ,e'41J^'\ (87) 

Fo J{mt/2)-A dE \mb J 

In other words, the end-point contribution is suppressed as a (relatively small) power 
of mr^. 



23 



We pause here to investigate in more detail what happens with the shape of the 
physical spectrum in the end-point domain when both effects - the perturbative gluon 
corrections and the primordial heavy quark motion - are taken into account. Recall 
that only the latter effect populates the spectrum to the right of the point E = mb/2. 
Using the general expression ([7^) and the double log result ( P^ ) for dTl^^^/dE we 
obtain 

--^= — / dzF{x + z)-{z^'^) (88) 
1 "a; \mi, J Jo az 

where sinvreo is approximated by Treo and 



Now, the combination of three essential factors - (i) the fast fall off of the primor- 
dial distribution F{y) at large y, (ii) the specific form of the perturbative kernel (p6| ) 
and (iii) the numerical smallness of eo - leads to the fact that the convolution (p8| ) 
turns out to be approximately local if x lies in the end-point domain, 

^AVVm, ,89) 



Tq dx yrrib J 

We hasten to add that the locality in z in eq. (^) is an approximation valid as long 
as eo can be considered as a small number. The size of the domain over which F{y) 
is smeared by the convolution (^) is of order 

Azchar ~ exp(-c/eo) (90) 

where the constant c is of order one. For eo ~ 0.3 we have Azchar ^0.1 which sets the 
resolution with which one can extract the primordial distribution from the measured 
photon spectrum in the end-point domain. 



A remark is in order here explaining how the estimate ([90| ) is obtained. Let us 



take a model for the function F{x) in the range < a; < 1 in the form 

F{x) = exp{—x/a) 



where a parametrizes the width of our model function. Then the convolution (|88D of 
this function with the perturbative kernel yields 

g-x/a ^zQ-z/ad^,^ _ e-^/'^a^«r(l + eo) 
Jo dz 

~e-"/"[l-eo ln(l/a)] 

provided that a « 1 — x ~ 1. We also expanded in eo and retain only the term 
with the potentially large logarithm. If eoln(l/a) << 1 the convolution we made 
reproduces the original function F{x). At very small values of a where eo ln(l/a) ~ 1 
the Sudakov kernel smearing starts working in full. In this way we arrive at eq. (|90D . 

Let us try to summarize our findings concerning the role of the perturbative gluon 
emission in the transitions like B —>■ 7X3. The gluon radiative corrections affect 
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strongly the height of the end-point peak (for the actual b quark this height is sup- 
pressed by a factor ~ 2.5 compared to the result ( |6TD containing no perturbative 
corrections). The theoretical uncertainty of order O{eo) in the overall factor is due 
to subleading logarithms which we neglected. At the same time, the shape of the 
end-point peak is affected to a much lesser extent. The primordial F{x) is smeared, 
of course, but the width of the smearing is still much less than unity, see eq. (PD]). 
(In the B — >■ 7X5 transition the size of the smearing interval is ~ 30 MeV.) Since 
the width of F{x) is about unity the above smearing does not change the shape 
drastically. 

As for the semi-leptonic decays the triple differential distribution ( |62|) is affected 
by perturbative gluons in the way very similar to that just discussed. Correspond- 
ingly, the energy spectrum dT{B — > WiXu)/dEi is changed: the integrand in eq. (|6^) 
acquires an extra factor due to the radiative corrections. Observe, that the extra in- 
tegration in dr{B IviXu) / dEi further smears -F(x); therefore, the energy spectrum 
is definitely not the best place for determining F{x). The radiative transitions or the 
double differential distribution in the semi-leptonic decays are much better suited for 
this purpose. 

Below the end-point domain the primordial motion plays no role, and the physical 
photon spectrum is fully controlled by perturbative effects. The Sudakov exponent 
introduces modifications of order unity parametrically earlier than nonperturbative 
effects: the latter appear when (1 — {2E/mh)) ~ A/nib whereas the former come into 
play already at \n(mb/{mh — 2E)) ~ c ■ (ln(mb/A))-^/^. 

Moreover, one can account for the nonperturbative corrections outside the end- 
point domain A/nib ^ (1 — {2E/mb)) <^ 1, that is just where the perturbative 
spectrum does not vanish. It can be done exactly like in ref. ^ and the result will be 
a regular series in 1/ rrih because the perturbative spectrum is smooth in this domain. 
In particular, no linear corrections to the shape formally appear. 

It is interesting to notice that the power (non-perturbative) corrections in the 
domain _ 

A ^ ^ 
— < 1 < 1 

nib mb 

are calculable directly from eq. (|7^) by expanding dTl^^^/dE in Ay, without any 
further computations. The linear term vanishes due to eq. while the quadratic 
term is expressed through eq. (|56|), 



dTB ^ d^lT^ l_ i2MBy\B\QTi^Q\B). 
dE dE^ 24^ ^ ^' ' 

There are other corrections, of course, for instance, the aG term, but they are 

not enhanced by powers of (1 — {2E/mb))~^. 

It should be added that even if the amount of the Fermi motion were very small 
compared to A (narrow F{x)), the region close to Mb/2 still would be populated 
due to a totally different mechanism. The pick-up mechanism associated with the 
diagrams where a semi-hard gluon exchanges momentum between the decaying heavy 
quark and the light spectator produces such final states; it is important that this 
mechanism does not suffer from the same Sudakov suppression. This contribution 
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can be analyzed in a similar way as it has been done, for example, for the heavy 
flavor exclusive decays into pion or p |T^. It seems definite, however, that for real 
beauty hadrons this mechanism is subdominant. 

One more subtle point which deserves mentioning in connection with our discus- 
sion of the gluon radiative corrections. If one goes beyond the tree approximation a 
specific normalization procedure must be formulated in order to make all operators we 
work with well-defined. This procedure will clearly introduce a normalization point 
/X. The operators become /i dependent through anomalous dimensions, i.e. standard 
logarithms of p show up. Besides these standard logs a power dependence on p is 
also present (see e.g. |^). For instance, the matrix element (tt^) can contain a 



piece ~ as{p)p'^. In practice such terms are difficult to control, so they represent, 
rather, a theoretical uncertainty. For the OPE machinery to be workable one needs 
to assume that there exists a range of /i where two contradicting requirements are 
simultaneously met: (i) as{p) is sufficiently small so that QCD perturbation theory 
makes sense; (ii) the uncertainty due to a^/i^ is much less than the matrix element 
(vf^) itself. Experience with the QCD sum rules teaches us that the requirements can 
be actually met - an appropriate choice of p is possible (for a review see [p3[1). 

In this context the most sensible issue is the problem of an unambiguous definition 
of the parameter A. The value of A per se does not exist in QCD as a rigorously defined 
constant (for recent discussion see e.g. |^). In reality it is the dominance of the 
condensates (non-perturbative matrix elements of relevant operators) that enables 
one to ascribe more or less definite physical meaning to this parameter. For our 
purposes we can accept the following working definition: A can be understood as the 
distance between the center of the end-point peak and the kinematic boundary in 
B ^ 

The problem with the definition of A which might seem to be of a crucial im- 
portance, actually does not affect our analysis. Indeed, instead of A we could use 
any fixed hadronic scale, say the QCD scale parameter A (the one defining the run- 
ning coupling constant) in all constructions presented in Sects. 2, 3 and 4 and in all 
definitions of the scaling variables. 



6 Comments on the Literature 

A formal definition of the heavy quark distribution function inside the heavy hadron 



was first given in Ref. |T0[ in the context of deep inelastic electroproduction on heavy 
hadrons. The moments of this function (which can be called a 'light-cone' function) 
were related to the matrix elements of the operators ([531). It is important that the 
'light-cone' distribution function defined by eq. ( ^31) appears in this problem only 
provided one assumes that ^ uiq. 

The prime object of interest for these authors was the heavy quark fragmentation 
in heavy hadrons. They did not apply the OPE machinery discussed above to the 
heavy hadron decays. 



While our definition of F(x) coincides with that of Ref. HTUI at the tree level, the 



perturbative renormalizations will be drastically different. The normalization point 
appropriate to DIS on heavy mesons considered in [1^ is ~ ^ rng, i.e. academ- 
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ically high. In our case, due to different kinematics, the appropriate normahzation 
point is ~ mgA. 

A similar definition of the 'hght cone' distribution function was suggested in [|11 



for the description of the energy spectrum near the end-point in the semi-leptonic 
heavy flavor decays. The shape function of Ref. |]TT| is related to our F{x) through 
an integration. 

Our analysis goes further in the following aspects. First, we consider different 
processes to underline the universal nature of the distribution function F{x). The 
effect is shown to be much more pronounced in the radiative decays and in the double 
differential distributions in the semi-leptonic decays. 

Second, we explain that the 'light cone' distribution function, however important 
it might be, is by no means a unique characteristic of the heavy quark motion. Other 
quantities which are measurable, at least in principle, can depend on other distribution 
functions which reflect the heavy quark motion in a different way. A clear-cut example 
is the heavy quark decay into another heavy quark. If the mass of the quark produced 
is such that it is non-relativistic (the SV limit) the spectrum in this transition will 
measure a 'temporal' distribution function G{x) rather than the 'light-cone' one. 

Finally, we consider an interplay between perturbative and non-perturbative ef- 
fects which becomes especially non-trivial in the end-point domain. The Sudakov 
exponent introduces its own smearing which is to be disentangled from that due to 
the primordial motion. We have demonstrated how one can do this disentangling in 
the radiative transitions and in the double spectral distributions in the semi-leptonic 
decays. 



7 Limits on the Average Kinetic Energy of the Heavy Quark 

This section is related to the contents of the previous part in a somewhat indirect 
manner. We will consider here general bounds on the hadronic matrix elements of 
the operators built from heavy quarks and some number of vr^'s. The first non-trivial 
operator of this type is 

a = Q^^Q (91) 

and we will mainly focus on discussion of O-,^. The matrix element of (9^ over Hq is 
parametrized as 

^ {HqIQti'QIHq) ^ f^l (92) 



Below we will need also the matrix element of the operator Oq 

OG = Q'-cr^uG,.Q, ,^^{Hq\Og\Hq) ^ (93) 

It is clear that all operators of the type Q'K^TTy...Q are related to the moments an 
of the distribution function F{x). The positivity of the distribution function implies 
certain constraint on the matrix elements of these operators, much in the same way 
as the positivity of the light-quark distributions leads to constraints on the matrix 
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elements of the operators appearing in the OPE analysis of deep inelastic scatter- 
ing. These constraints are rather trivial. Perhaps, the most well-known example 
in deep inelastic scattering is the nucleon matrix element of the quark part of the 
energy-momentum tensor. The positivity of the distribution functions implies that 
the quark's share of the nucleon momentum is always less than one. In the case of 
the heavy quarks the constraints are weaker since the lower limit of variation of x 
is unknown a priori. Still, one can get an idea of the upper bound on /i^ if one 
assumes that —Xq is close to —1. There are absolutely no good reasons to beheve 
that —Xq = —1, still it is instructive to accept this assumptions and see what the 
consequences are. 

Were the function F{x) symmetric in x then we would automatically have —xq = 
— 1. We could then easily obtain an upper bound on fi^. Unfortunately, F{x) is by no 
means symmetric. The parameter xq introduced and discussed after eq. (|3lD is not 
equal to one. Moreover, perturbative effects enhance the tail at negative x. At the 
same time inclusion of these effects (given all our approximations and assumptions) 
does not change the fact that F(x) = at x > 1. Thus, strictly speaking, we can 
not get a rigorous upper bound on /i^. If for the purpose of orientation we assume 
for a moment that the distribution function is (nearly) symmetric and Xq = 1 then 
/ x'^dxF{x) < J dxF{x) = 1 which implies, in turn, the bound 

^ (vr^) < 3A\ (94) 



It is rather curious that the estimate of /i^ from the QCD sum rules nearly 
saturates this limit. 

One can obtain an upper limit on the second moment, however, with no further 
assumptions on Xq, provided the third moment is known from independent sources. 
For instance, one can use the fact that for all x < 1 one has < 1 + x — x^ that 
implies the inequality 

as < 1 - as. (95) 
Here we take into account that ai = 0. 



For an estimate of as we will rewrite eq. (|57|) in the following form 

]_J_(_2g'' 
6A' 2Mb V 9 



as ^ -::=3 ^ { — ^ ) {B\Hq\0){0\q^,b\B) = (96) 
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where D^G^q is substituted by the light quark current (the equation of motion) and 
then factorization is applied for the matrix element over the B meson state. Eqs. 



( p5| ) and (0) imply that 

{f^><3A%^/^. (97) 

The extra term in eq. ( pT)) is positive which is welcome in connection with the 
prediction p5| for (vr^) mentioned above. The numerical value of the extra term due to 



the third moment is rather uncertain. If g'^/4:7T is set equal to 1 then ((7^/18)/^Mb 
0.1 GeV^. The increase in the naive bound (B3) is quite modest. 
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{<){K)>-AB^)" (100) 



Let us proceed now to the discussion of the lower bound the matrix element of the 
operator O^r- Unhke the previous case a strict lower bound (strict in the quantum- 
mechanical sense) exists. 

The physical meaning of this matrix element is the average kinetic energy (more 
exactly, the spatial momentum squared) of the heavy quark Q inside Hq. This 
operator is spin- independent, and we are aware of no method of extracting its matrix 
elements from phenomenology, without invoking additional theoretical arguments or 
models. 

It is convenient to introduce a parameter similar to /i^ associated with Ot^, 

-l^{HQ\Qn'Q\HQ) = fil. (98) 

It is plausible that /i^ for mesons and baryons is different - there is no reason why 
they should coincide. 

For mesons a reliable lower bound on /i^ can be given from essentially a quantum 
mechanical consideration. Indeed, let us take into account the fact that 

[TTfcTT;] = -ieklnBn (99) 

where i?„ = —enpqGpq/2 is a chromomagnetic field. Then eq.(^) immediately leads 
to analog of the uncertainty principle for vTa,, tt^, namely 

1 

T 

where (...) means average over some hadronic state. It is convenient to use the 
polarized state of B* meson with 5*2 = 1 at this step. The average chromomagnetic 
field in this state can be presented as a matrix element of the operator Og, 

{B.) = -^^B*{S, = 1)\Og\B*{S, = 1)). (101) 

Accounting for the fact that the chromomagnetic field is proportional to the spin of 
the light cloud, B = kSic, one can relate the average ( |101| ) to the average over the B 
meson state {S = 0) and to the mass splitting between B* and B, 

{B.) = I ^{B\Og\B) = ^ = 1(M|, - M|) (102) 
It stems from three-dimensional rotational symmetry in the rest frame that 

{^D = = i^l) 

and, hence, 

{n')=fiUB)>^f,UB) (103) 

Numerically for B mesons fiQ ~ 0.35 GeV^ and then eq. ( [L03| ) implies that 
/i^ > 0.18 GeV^. Calculation |25| of this parameter from the QCD sum rules gives 
a value which is only three times larger than our lower bound. Namely, according to 
[H fxl = 0.54 ±0.12 GeV^. 

Consideration of possible constraints on /i^ from a somewhat different line of 



reasoning was recently presented in [26 
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8 Summary and Conclusions 



In this section we summarize our main results and findings. First and foremost, it 
is demonstrated that combining QCD with the heavy quark expansion provides us 
with a powerful tool for describing the motion of the heavy quark inside the heavy 
hadrons. The formalism emerging in this way is model independent and similar to 
that used in deep inelastic scattering. Technically our consideration is based on the 
notion of the transition operator T and Wilsonian operator product expansion. The 
corresponding ideas can be traced to Refs. |l|, |^ . 

The inclusive spectra and the decay rates in the semi-leptonic and radiative tran- 
sitions of the heavy fiavor hadrons are determined by universal distribution functions. 
The latter can be introduced through matrix elements of the relevant local operators, 
much in the same way it is routinely done in deep inelastic scattering. 

The description of distributions in inclusive heavy fiavor decays - lepton or photon 
spectra in semileptonic or radiative beauty decays, respectively - obtained in QCD 
proper would appear to be in obvious confiict with the data if the heavy quark Q were 
at rest inside the hadron Hq. For in the simple case of radiative B decays the photon 
spectrum would consist of a single line somewhat below the kinematical boundary. 
Intuitively it is obvious that Q moves inside Hq. The single photon line in radiative 
beauty decays then gets spread out by an amount of order A. We have pointed out 
in the past and shown in explicit detail in the present paper that this kind of 
Fermi motion - rather than being forced upon us by phenomenological constraints - 
finds a natural home in a rigorous QCD treatment. The salient features of the results 
obtained are: 

• The motion of the heavy quark inside a given hadron is described by a universal 
function; i.e., it is the same function that has to be folded with the primary lepton or 
photon spectra that arise from the heavy quark decay to obtain the non-perturbative 
corrections to the observable spectra, in particular in the end-point region. This 
universal function cannot be determined from perturbation theory - like it is for the 
structure functions in DIS. The universality of the heavy quark distributions take 
place only for a given value of the final quark mass. Changing this mass we pass to a 
different distrubition. In particular, two extreme cases have been considered in some 
detail, the massless final quark and the non-relativistic final quark. In the first case 
one has to deal with the 'light-cone' distribution function while in the second case it 
is the 'temporal' distribution function that is relevant. 

• Although the motion of the heavy quark is non-relativistic there exists no non- 
relativistic hadronic wavefunction that can reproduce the true effect of the Fermi 
motion beyond its second moment ((jjp)^). 

• A mo del- independent lower bound has been derived for ((vf)^) that is not more 
than a factor of three below what has been inferred from QCD sum rules [P^]. 

• The impact of perturbative gluon emission on the spectra is very considerable, 
in particular again in the end-point region. Incorporating both perturbative as well 
as non-perturbative corrections in a self-consistent way poses a highly non-trivial 
problem. Our treatment indicates that in beauty decays one can define and combine 
perturbative and non-perturbative corrections with sufficient practical precision. 
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Note added. Two publications devoted to the subject of the present work 
have just appeared p7| , We plan to comment on the results of in our 



forthcoming paper |^ 
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Figure captions 

Fig. 1. The spectrum in the transition Q ^ (pq for free quarks. 

Fig. 2. Smearing of the spectrum in Hq — > (f)Xq due to the heavy quark primordial 
motion in Hq. 

Fig. 3. The graph determining the transition operator in the Born approximation. 

Fig. 4. The graph giving rise to the correction due to the operator QqqQ in the 
transition operator. 
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